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on small scales not amenable to measurements by any other means. Also, it can be a unique 
probe of the heating process and the thermal history of the early universe, as the signal is strongly 
dependent on the IGM temperature. Here we show what SKA 1-low could do in terms of detecting 
the 21cm forest in the redshift range z ~ 7.5 — 15. 
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1. Introduction 


An alternative to both the tomography teehnique and the power speetrum approaeh is to seareh 
for the 21em forest, that is the 21em absorption features against high-z radio loud sourees eaused 
by the intervening eold neutral IGM and eollapsed struetures (e.g. Carilli, Gnedin & Owen 2002; 
Furlanetto & Loeb 2002; Furlanetto 2006; Carilli et al. 2007; Xu et al. 2009; Xu, Ferrara & Chen 
2011; Maek & Wyithe 2012; Ciardi et al. 2013; Ewall-Wiee et al. 2014). In faet the 21em forest is 
more than a eomplement to tomography or power speetrum analysis. Sinee the strongest absorption 
features arise from small seale struetures, the 21em forest ean probe the HI density power speetrum 
on small seales not amenable to measurements by any other means (e.g. Shimabukuro et al. 2014). 
Also, it ean be a unique probe of the heating proeess and the thermal history of the early universe, 
as the signal is strongly dependent on the IGM temperature. 

The photons emitted by a radio loud souree at redshift with frequeneies V > V2icm> will be 
removed from the souree speetrum with a probability 1 — absorbed by the neutral hydrogen 

present along the line of sight (LOS) at redshift z = V2icm/v(l +Zs) — 1- The optieal depth T2icm 
ean be written as (e.g. Madau, Meiksin & Rees 1997; Furlanetto, Oh & Briggs 2006): 

3 hpC^A2i cm 


'f21cm(^) — 


( 1 . 1 ) 


3271 r,(l+z)(<iv||/(7r||)’ 

where tih is the H number density, xhi is the mean neutral hydrogen fraetion, Tg is the gas spin 
temperature (whieh quantifies the relative population of the two levels of the ^51/2 transition), 
^21cm = 2.85 X 10 s Ms the Einstein eoeffieient of the transition and dv\\/dr\\ is the gradient 
of the proper veloeity along the LOS (in km s^^), whieh takes into aeeount also the eontribution 
of the gas peeuliar veloeity. The other symbols appearing in the equation above have the standard 
meaning adopted in the literature. 

Analogously to the ease of the Ly-a forest, this eould result in an average suppression of 
the souree flux (produeed by diffuse neutral hydrogen), as well as in a series of isolated absorption 
lines (produeed by overdense elumps of neutral hydrogen), with the strongest absorption assoeiated 
with high density, neutral and eold patehes of gas. This suggests that the absorption features due to 
eollapsed struetures with no or very small star formation (to maximize the amount of HI available 
for absorption), sueh as minihalos or dwarf galaxies (Eurlanetto & Loeb 2002; Meiksin 2011; 
Xu, Lerrara & Chen 2011) would be easier to deteet than those due to the diffuse neutral IGM. 
However, this does strongly depend on the feedbaek effeets aeting on sueh objeets. Beeause of the 
large uneertainties in the nature and intensity of high-z feedbaek effeets (for a review see Ciardi & 
Lerrara 2005 and its updated version arXiv:astro-ph/0409018), it is not straightforward to estimate 
the relative importanee of the absorption signals from the diffuse IGM and from eollapsed objeets. 

While gas whieh has been (even only partially) ionized has a temperature of lO'^ K, gas 
whieh has not been reaehed by ionizing photons has a temperature whieh ean be even lower that of 
the CMB. This neutral gas ean be heated by Ly-a or X-ray photons, thus redueing the optieal depth 
to 21em. While Ly-a heating is not extremely effieient, heating due to X-ray photons eould easily 
suppress the otherwise present absorption features (e.g. Maek & Wyithe 2012; Ciardi et al. 2013). 
This seems to suggest that with observations of the 21em forest it would be possible to discriminate 
between different IGM reheating histories, in particular if a high energy component in the ionising 
spectrum were present (Ewall-Wice et al. 2013). 
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2. Observed spectra 


In this Section we describe the process used to simulate an observed spectrum. The simulated 
absorption spectrum, Sabs, is calculated from a full 3D radiative transfer simulation of IGM reion¬ 
ization which resolves scales of ~ 1 kHz (corresponding to ~50 kpc comoving; Ciardi et al. 2012; 
Ciardi et al. 2013). In the simulation used here, the contribution to ionization and heating from 
x-rays and/or Ly-a photons is not included (although see Ciardi et al. 2013 for examples of spectra 
which include such effects), i.e. the gas which is not reached by UV ionizing photons remains cold. 

The instrumental effects are estimated using the pipeline of the LOFAR telescope. More 
specifically, the equation giving the observed visibilities is: 


K sourc es 

Fv(u)= ^ (2.1) 

i 

where u = (m,v,w) are the coordinates of a given baseline at a certain time t, ly is the observed 
source intensity, s = {I,m,n) is a vector representing the direction cosines for a given source direc¬ 
tion and ris represents additive noise. The noise is given by the radiometer equation: 


1 SEFD 

T], \/2tintAv ’ 


( 2 . 2 ) 


where rjs is the system efficiency, Av is the bandwidth and is the integration time. Because of 
our experience with the LOFAR telescope, we will make predictions based on its characteristics, 
and we will then scale the noise to match the one expected from SKA. We thus assume that rj^ = 0.5 
and Nst = 48. The system equivalent flux density is given by: 


SEFD = 


2 Kb 2sys 
AdipflaAeff 


(2.3) 


where Kb is Boltzmann’s constant, Agff = 1.5626) is the effective area of each dipole in the 

dense and sparse array regimes respectively, Adip is the number of dipoles per station (24 tiles times 
16 dipoles per tile for a LOFAR core station) and rja is the dipole efficiency which we assume to 
be 1. The system noise Tgys has two contributions: (i) from the electronics and (ii) from the sky. We 
assume that the sky has a spectral index of -2.55, obtaining T^ys = [140-1- 60(v/150 MHz)^^'^^] K. 
The complete Fourier plane sampling can be done by evaluating the above equation for every set of 
baseline coordinates. The predicted visibilities are then gridded and transformed via inverse Fourier 
transforms in order to obtain the dirty images. For the purpose of the 21 cm forest, fine spectral 
resolution (~ 1 kHz, see later for further discussion) is needed, which means that the spectra need 
to be predicted for a large number of channels (~ 10000). After assembling the full image cube, 
the LOS spectrum is extracted. In principle, the effect of the Point Spread Function (PSF) side 
lobes running through the source of interest can be taken into account by including more sources 
at different positions, with or without absorption features. Here, we have ignored the effect of 
side lobe noise. In fact, as the SKA will have a very dense uv coverage and the expected 21cm 
absorption lines will be narrower than a few tens of kHz, we expect the side lobe noise to play a 
marginal role. 

Figure shows the 21cm absorption spectrum due to the diffuse IGM along a random LOS 
for a bright radio source at z = 10 (i.e. v ~ 129 MHz). For an easy comparison to existing work 
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Figure 1: Upper panels: Spectrum of a radio source positioned atz = 10(v^ 129 MHz), with a power-law 
index a = 1.05 and a flux density J = 50 mJy (left hand panels) and 10 mJy (right hand panel). The red 
dotted lines refer to the instrinsic spectrum of the radio source, Sm, the blue dashed lines to the simulated 
spectrum for 21cm absorption, 5abs (in a universe where neutral regions remain cold); and the black solid 
lines to the spectrum for 21cm absorption as it would be seen with an observation time fjnt = 1000 h and a 
frequency resolution Av = 10 kHz. The first panel to the left corresponds to a case with the LOFAR noise, 
while the other two panels have 1/lOth of the LOFAR noise, roughly expected for SKAl-low. Lower panels: 
The ratio (Jabs/Cobs corresponding to the upper panels. 


on the 21cm forest (e.g. Carilli, Gnedin & Owen 2002; Mack & Wyithe 2012; Ciardi et al. 2013), 
the intrinsic radio source spectrum, S'in, is assumed to be similar to Cygnus A, with a power- 
law with index a = 1.05 and a flux density 7 = 50 mJy and 10 mJy. The simulated absorption 
spectrum, S'abs. is calculated from the simulations mentioned above. The observed spectrum, Sobs^ 
is calculated assuming an observation time hnt = 1000 h with the LOFAR and SKAl-low telescopes 
and a bandwidth Av = 10 kHz. A clear absorption signal is observed. This is more evident in the 
lower panels of Figure [I], which show the quantity CJabs/o'obs^ where a, = Si — Sjn and /=abs, obs. 
As already mentioned above, the inclusion of Ly-a or x-ray heating could suppress or reduce the 
absoprtion features, with the extent of the effect being highly dependent on the source model (see 
e.g. Mack & Wyithe 2012; Ciardi et al. 2013). 

Very strong absorption features could be easily detected also at lower redshift, when most of 
the IGM is in a highly ionization state, if we were lucky enough to intercept high density cold 
pockets of gas (with T 2 icm >0.1; these cells are found in ~ 0.1% of the LOS in the simulation), as 
shown in Figure 

Moving towards higher redshift, when most of the gas in the IGM is still neutral and relatively 
cold, would offer the chance of detecting a stronger average absorption (rather than the single 
absorption features observed at lower redshift). If a radio source with characteristics similar to the 
ones described above were found, SKAl-low would easily detect the global absorption as shown 
in Figure although it would not be straightforward to distinguish whether the suppression of the 
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Figure 2: Upper panels: Spectrum of a radio source positioned at z = 7.6 (v ^ 165 MHz), with a power-law 
index a = 1.05 and a flux density J = 50 mJy (left hand panels) and 10 mJy (right hand panel). The red 
dotted lines refer to the instrinsic spectrum of the radio source, Sm, the blue dashed lines to the simulated 
spectrum for 21cm absorption, 5abs (in a universe where neutral regions remain cold); and the black solid 
lines to the spectrum for 21cm absorption as it would be seen with an observation time fjnt = 1000 h and a 
frequency resolution Av = 5 kHz. The first panel to the left corresponds to a case with the LOFAR noise, 
while the other two panels have 1/lOth of the LOFAR noise, roughly expected for SKAl-low. Lower panels: 
The ratio (Jabs/Cobs corresponding to the upper panels. 


source flux were due to the intervining neutral IGM or an intrinsically lower flux. 


3. Challenges 


The most challenging aspect of the detection of a 21cm forest remains the existence of high-z 
radio loud sources. Although a QSO has been detected at z = 7.085 (Mortlock et al. 2011), it is 
radio quiet (Momjian et al. 2014), and the existence of even higher redshift quasars is uncertain. 
To observe the absorption features in the spectrum, this has to be observed with a certain precision, 
which depends on the brightness of the quasar and the sensitivity of the instruments. The minimum 
detectable flux density of an interferometer can be written as: 


^S 


min 


2KBT,y, s 

A^J^Vtint N' 


(3.1) 


where A is the collecting area of the array and S'/A is the signal-to-noise ratio. As one may not 
be able to distinguish whether the flux decrement were due to the diffuse IGM or an intrinsically 
lower flux, we will probably only detect the additional absorptions with respect to the absorption 
by the IGM. Therefore, the minimum flux density of the background source required to observe 
the absorption lines is: 


10.3mJy 
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Figure 3: Upper panels: Spectrum of a radio source positioned at z = 14 (v ^ 95 MHz), with a power-law 
index a = 1.05 and a flux density J = 50 mJy (left hand panels) and 10 mJy (right hand panel). The red 
dotted lines refer to the instrinsic spectrum of the radio source, Sm, the blue dashed lines to the simulated 
spectrum for 21cm absorption, 5abs (in a universe where neutral regions remain cold); and the black solid 
lines to the spectrum for 21cm absorption as it would be seen with an observation time fjnt = 1000 h and a 
frequency resolution Av = 20 kHz. The first panel to the left corresponds to a case with the LOFAR noise, 
while the other two panels have 1/lOth of the LOFAR noise, roughly expected for SKAl-low. Lower panels: 
The ratio (Jabs/Cobs corresponding to the upper panels. 


The SKAl-low will have A/T^ys ~ 1000m^K^\ and we expect A/T^ys ~ 4000m^K^' for SKA2- 
low. 

Extrapolating the observed number density of radio sources at z = 4 (Jarvis et al. 2001) to 
higher redshift and lower luminosity, one can calculate the number of quasars with flux density at 
the observed frequency Vobs = 1420.4/(1 -|-z) MHz larger than the lower limit described above, for 
a flat evolution model and a steep evolution model respectively (Xu et al. 2009). Using the planned 
SKA sensitivities, the predicted numbers of qualified radio sources are plotted in Fig.^ 

The predicted number of radio sources which can be used for 21cm forest studies in the whole 
sky per unit redshift at z = 10 varies in the range 8 x 10^ — 2 x 10^. This expected number, which 
is based on observations at redshift lower than reionization, is very uncertain. The exact number 
depends on the model adopted for the luminosity function of such sources and the instrumental 
characteristics (e.g. Carilli, Gnedin & Owen 2002; Xu et al. 2009), making such a detection an 
extremely challenging task. As a reference, the estimated quasar number per unit redshift at z = 10 
for the 3-tiered survey is reported in Table |I|. 

If a sufficiently bright radio loud quasar were found beyond the redshift of reionization, then 
the absorption lines generated from early non-linear structures could be easily detected, as the 
spectral resolution of the SKA will not be a problem. Assuming the minimum halo mass hosting 
cold neutral gas to be IO^Mq, about one absorption line in every 8.4kHz is expected at redshift 
z ~ 10. The absorption line density is lower for lower redshift and higher minimum mass. On 
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Figure 4: The number of quasars in the whole sky that can be used to detect signals with _ g-r ^ q q ^ 
per redshift interval. The sensitivity of the radio array is taken tobe A/Tsys = 1000 (black solid lines) 

and 4000 (blue dashed). The integration time is assumed to be 1000 hours. Left panel: the number 

of qualified quasars in the flat evolution model. Right panel: the number of qualified quasars in the steep 
evolution model. 
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1000 
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57 

Flat 
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1000 

4000 
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Steep 

1000 

100 
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10000 

10 

1000 

10 

Steep 

100 

1000 

4000 

25 

Steep 

1000 

100 

4000 

63 

Steep 

10000 

10 

4000 


Table 1: Estimated quasar number at z = 10 for the 3-tiered survey. The columns refer to; number per unit 
redshift, source model, survey area, observation time, A/Tsyg. 
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the other hand, the line width from non-linear structures ranges mostly from ~ IkHz to ~ 5kHz. 
Given that the SKAl-low will have a spectral resolution of IkHz, the line counting is feasible as 
long as sufficiently bright radio sources are available at high redshift. Especially, if one stacks 
together several lines to get an average profile, if will hopefully reveal fhe physical sfafus of fhe 
early non-linear sfrucfures. 

An alfemafive possibilify is fhe radio aflerglows of cerfain types of gamma-ray bursfs (GRBs). 
GRBs have already been observed up fo z ~ 8 — 9, and if is plausible fhaf fhey occur up fo fhe earliesf 
epochs of sfar formafion in fhe universe at z ~ 20 or higher. In addition, they have a simple power- 
law spectrum at low frequencies, making the signal extraction easy. However, if such GRBs are 
similar to those seen at lower redshifts, their radio afterglows are not expected to be bright enough 
at the relevant observed frequencies Vobs ^ 100 MHz due to strong synchrotron self-absorption 
(loka & Meszaros 2005; Inoue, Omukai & Ciardi 2007). On the other hand, it has been recently 
proposed that GRBs arising from Population (Pop) III stars forming in metal-free or very metal 
poor environments may be much more energetic compared to ordinary GRBs, leading to blastwaves 
expanding to much larger radii, and consequently much brighter low-frequency radio afterglows, 
exceeding tens of mJy (Toma, Sakamoto & Meszaros 2011). This occurs over timescales of up to ~ 
1000 yr, making them virtually steady radio sources, differently from more standard, lower redshift 
GRBs which, on the other hand, offer relatively short integration times and a limited spectral length 
between the location of the GRB and the end of reionization (e.g. Xu et al. 2011). In Figure ^ we 
show an example of absorption spectra in high-z GRBs, from both Pop III and Pop II stars (Ciardi 
et ah, in prep). 

If the rate of Pop III GRBs with sufficiently bright radio emission is 0.1 yr^^ or roughly 10^^ 
of all GRBs, one may expect ~ 100 such sources all sky at a given time. A practical question 
that remains is how we can observationally identify such sources. One possibility of using GRB 
afterglows as the background radio source is the broad-band observation, measuring the mean flux 
decremenf in each band wifhouf resolving individual absorpfion lines (Xu el al. 2011). 

4. Discussion and Summary 

As explained above, absorpfion fealures due lo small collapsed objecfs can be much sfronger 
fhan fhose due fo fhe diffuse neulral IGM. Since Iheir cross-secfions are small, fhe besl condifions 
for delecfing Ihem would be when Ly-a coupling pushes fhe spin femperafure in Iheir lower density 
oufskirfs lo fhe gas femperafure before Ihese regions have been affecled by any healing (see Fig. 22 
in Meiksin 2011), condifions expecled above z ~ 10. However, even after healing has sfarled lo 
suppress fhe 21cm absorpfion signal, some weak fealures due lo collapsed sfrucfures may remain. 
Inleresfingly, even when if may nol be possible fo delecl Ihese weak fealures individually, fhe 
presence of absorpfion may be delected statistically. Weak absorption would produce an increase 
in the variance of brightness fluctuations, as an addition to the telescope noise, resulting in an 
apparently noisier spectrum blue-ward of the 21cm transition (see Fig. 29 in Meiksin 2011; Fig. 8 
in Mack & Wyithe 2012). The possibility to have a statistical detection of the 21cm forest (rather 
than the detection of single absorption features), is intriguing as the signature of the forest would be 
observed in the 21cm power spectrum (see also Ewall-Wice et al. 2013). By integrating the signal 
from many high redshift sources within the field of view, would reduce fhe sensifivily requiremenfs 
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Figure 5: Upper panels: Spectrum of a GRB positioned at z = 10 (v ^ 129 MHz), with a power-law index 
a — 0.5 and a flux density J — 30 mJy (GRB from Pop III stars; left hand panel) and 0.1 mJy (GRB from 
Pop II stars; right hand panel). The red dotted lines refer to the instrinsic spectrum of the source, Sin', the blue 
dashed lines to the simulated spectrum for 21cm absorption, 5abs (in a universe where neutral regions remain 
cold); and the black solid lines to the spectrum for 21cm absorption as it would be seen with an observation 
time tint = 1000 h and a frequency resolution Av = 10 kHz. The panel to the left (right) corresponds to 
a case with 1/lOth (1/lOOth) of the LOFAR noise, roughly expected for SKAl-low (SKA2-low). Lower 
panels: The ratio Cabs/Obb.s corresponding to the upper panels. 


of the instrument. On the other hand, the main problem of the paucity of high-z radio sources 
would persist. 

The attempt to detect the 21cm forest has the potential to provide unprecedented information 
about the large-scale evolution of the intergalactic medium as well as the growth of small-scale 
structures. Different information can be gleaned depending on the nature of the observation. An 
attempt to detect the presence of the 21cm forest via a statistical analysis of the power spectrum 
(Ewall-Wice et al. 2013) will allow us to put constraints on the presence of high-redshift radio- 
loud sources at high wavenumbers {k > 0.5 Mpc^^), and to study the IGM thermal history at low 
wavenumbers (k < 0.1 Mpc^^). Assuming high-redshift radio-loud sources are identified, and we 
are able to observe them directly in targeted observations, detailed study of the source spectra 
can reveal the mean thermal properties of the IGM via the flux decrement (Furlanetto 2006) or 
variance in the flux (Carilli et al. 2004; Mack & Wyithe 2012). If individual absorption features 
are identified in the spectrum, this could allow us to put constraints on the clumping factor at early 
times (Xu, Ferrarra & Chen 2011) and to map structure along the line of sight (IGM and/or the first 
collapsed structures and mini-halos), thus significantly extending our understanding of the process 
of reionization and the hierarchical growth of structure in the Universe. 

Finally, we would like to comment on the fact that as observations of the 21 cm forest require 
very high frequency resolution, they are demanding both in terms of data storage and processing. 
However, when a bright continuum radio source is identified after an all-sky survey or deep EoR 
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observations, it is possible to significantly average in time after phasing up to the source. The max¬ 
imum amount of data compression will be dictated by the need to identify and subtract bright, far 
away sources that may generate sidelobe noise in the measurement of the 21 cm forest. However, 
if bright sources are measured a priori from a sky survey they can be subtracted from the high time 
resolution visibilities and, afterwards, data can be easily compressed by an order of magnitude or 
more. 
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